Expert system for clustering prokaryotic species by their metabolic features
نویسندگان
چکیده
Studying the communities of microbial species is highly important since many natural and artificial processes are mediated by groups of microbes rather than by single entities. One way of studying them is the search of common metabolic characteristics among microbial species, which is not only a potential measure for the differentiation and classification of closely-related organisms but also their study allows the finding of common functional properties that may describe the way of life of entire organisms or species. In this work we propose an expert system (ES), making the main contribution, to cluster a complex data set of 365 prokaryotic species by 114 metabolic features, information which may be incomplete for some species. Inspired on the human expert reasoning and based on hierarchical clustering strategies, our proposed ES estimates the optimal number of clusters adequate to divide the dataset and afterwards it starts an iterative process of clustering, based on the Self-organizing Maps (SOM) approach, where it finds relevant clusters at different steps by means of a new validity index inspired on the well-known Davies Bouldin (DB) index. In order to monitor the process and assess the behavior of the ES the partition obtained at each step is validated with the DB validity index. The resulting clusters prove that the use of metabolic features combined with the ES is able to handle a complex dataset that can help in the extraction of underlying information, gaining advantage over other existing approaches, that may relate metabolism with phenotypic, environmental or evolutionary characteristics in prokaryotic species. 2013 Elsevier Ltd. All rights reserved.
منابع مشابه
Study of the metabolic profile of Papaver extracts by chromatographic and chemometrics methods
Background and objectives: Chromatography fingerprinting is considered as a comprehensive method for quality control, diagnosis and the nature of herbal drugs, and it is important to classify the different samples of medicinal plants and determine the chemical species present in them. Methods: In this research, a new strategy based on the combination of multiva...
متن کاملClassification of encrypted traffic for applications based on statistical features
Traffic classification plays an important role in many aspects of network management such as identifying type of the transferred data, detection of malware applications, applying policies to restrict network accesses and so on. Basic methods in this field were using some obvious traffic features like port number and protocol type to classify the traffic type. However, recent changes in applicat...
متن کاملUse of metabolomics for the chemotaxonomy of legume-associated Ascochyta and allied genera
Chemotaxonomy and the comparative analysis of metabolic features of fungi have the potential to provide valuable information relating to ecology and evolution, but have not been fully explored in fungal biology. Here, we investigated the chemical diversity of legume-associated Ascochyta and Phoma species and the possible use of a metabolomics approach using liquid chromatography-mass spectromet...
متن کاملThe Origins of Ecological Diversity in Prokaryotes
The urkingdoms and major divisions of prokaryotes are enormously diverse in their metabolic capabilities and membrane architectures. These ancient differences likely have a strong influence on the kinds of ecological adaptations that may evolve today. Some ecological transitions have been identified as having occurred primarily in the distant past, including transitions between saline and non-s...
متن کاملProviding a Fuzzy Expert System to Assess the Maturity Level of Companies in Manufacturing Excellence in the Food Industry of Iran
This study seeks to develop a fuzzy expert system to help managers in assessing their effectiveness and position of their business on the manufacturing excellence track. Assessment process is multi-dimensional in nature and there is a relationship between the different variables of the system. In addition, both quantitative and qualitative variables as well as the uncertainty in the statements ...
متن کاملذخیره در منابع من
با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید
برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید
ثبت ناماگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید
ورودعنوان ژورنال:
- Expert Syst. Appl.
دوره 40 شماره
صفحات -
تاریخ انتشار 2013